XML Source Preparation for Building Data Warehouses
نویسندگان
چکیده
Faced with the high economic competition, today’s enterprises are forced to rely on decision support systems to assist them in the analysis of large data volumes. Traditionally, the analyzed data are mainly issued from the enterprise’s operational information system. However, due to the international nature of the competition, enterprises are increasingly pressed to explore other, external data sources, mainly issued from the web. However, despite the convergence of XML as a standard data format on the web, a main difficulty in exploiting these data is the lack of data warehouse design approaches for XML data sources. The few proposed approaches suppose that the designer must first manually identify the XML element that represents the analyzed fact; such a step requires a high expertise and domain knowledge. The main contribution of this paper is to automate this step.
منابع مشابه
On Data Cleaning In Building XML Data Warehouses
One of the most important aspects in building an XML data warehouse is data cleaning and integration process. This paper presents a detailed methodology for cleaning data and integrating, especially useful for general situations when different-source documents are involved. Both situations whereby the XML documents have an associated XML Schema or they are just independent XML documents are con...
متن کاملA Methodology for Building XML Data Warehouses
Developing a data warehouse for XML documents involves two major processes: one of creating it, by processing XML raw documents into a specified data warehouse repository; and the other of querying it, by applying techniques to better answer users’ queries. This paper focuses on the first part; that is identifying a systematic approach for building a data warehouse of XML documents, specificall...
متن کاملOn Building XML Data Warehouses
Developing a data warehouse for XML documents implies two major processes: one of creating it, by processing XML raw documents into a specified data warehouse repository; and one of querying it, by applying techniques to better answer user’s queries. This paper focuses on the first part; that is identifying a systematic approach for building a data warehouse of XML documents, specifically for t...
متن کاملAutomating conceptual design of web warehouses
Web warehousing plays a key role in providing the managers with up-to-date and comprehensive information about their business domain. On the other hand, since XML is now a standard de facto for the exchange of semi-structured data, integrating XML data into web warehouses is a hot topic. In this paper we propose a semi-automated methodology for conceptual design of web warehouses from XML sourc...
متن کاملWeb data modeling for integration in data warehouses
In a data warehousing process, the data preparation phase is crucial. Mastering this phase allows substantial gains in terms of time and performance when performing a multidimensional analysis or using data mining algorithms. Furthermore, a data warehouse can require external data. The web is a prevalent data source in this context, but the data broadcasted on this medium are very heterogeneous...
متن کامل